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ABSTRACT: 



The contact process on an infinite homogeneous tree is shown to exhibit at least two phase 
transitions as the infection parameter A is varied. For small values of A a single infection 
eventually dies out. For larger A the infection lives forever with positive probability but 
eventually leaves any finite set. (The survival probability is a continuous function of A, 
and the proof of this is much easier than it is for the contact process on (i-dimensional 
integer lattices.) For still larger A the infection converges in distribution to a nontrivial 
invariant measure. For an ra-ary tree, with n large, the first of these transitions occurs 
when A ~ 1/n and the second occurs when l/2y/n < A < ej \fn. Nonhomogeneous trees 
whose vertices have degrees varying between 1 and n behave essentially as homogeneous 
n-ary trees, provided that vertices of degree n are not too rare. In particular, letting n 
go to oo, Galton- Watson trees whose vertices have degree n with probability that does 
not decrease exponentially with n may have both phase transitions occur together at 
A = 0. The nature of the second phase transition is not yet clear and several problems 
are mentioned in this regard. 
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1 Introduction 



This paper studies the contact process on trees. A tree is just a connected unoriented 
graph without cycles. A brief description of the contact process is as follows. The state 
space, in this case a tree, is any undirected graph. At any instant in time, some vertices 
of the graph are infected and some are healthy. The infected vertices recover (turn into 
healthy vertices) at poisson rate 1, independently. Each infected vertex independently 
infects each of its healthy neighbors at poisson rate A for some fixed parameter A. More 
complete descriptions of this are available in [Li] and [Du], along with a construction 
(which I will later describe) of the process from a poisson process called the graphical 
representation. 

There are practical and mathematical reasons to study the contact process on trees. 
The practical reason is that the process was developed to model the spread of infectious 
disease. The vertices of the graph represent individuals susceptible to the disease and 
the edges represent pairs of individuals who may have frequent contact. Traditionally, 
the graph is taken to be the n-dimensional integer lattice, probably because that makes 
sense for other interacting particle systems such as the Ising model for ferromagnetism. 
While the "real" graph is a large finite graph, a tree is at least as likely as a lattice to 
serve as a local approximation to the real graph for a population. 

One mathematical reason for studying the contact process on trees is to better under- 
stand the correspondence between behaviour of processes on trees and limits of behaviours 
on lattices as the dimension of the lattice goes to infinity. For other processes such as 
random walks and the Ising model, there is a critical dimension beyond which the lattice 
versions of the process behave in essential ways like the tree version. 

Another reason to study trees is the following observation about the contact process 
on different graphs. If G and H are graphs with a map <fi from the vertices of G to the 
vertices of H, such that (x, y) is an edge of G whenever (<f>x, <fyy) is an edge of H, then the 
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contact process may be coupled so that the number of infections on G is always at least 
the number of infections on H . (Letting infections and recoveries on G proceed as on H 
but subject to the restriction that there can be at most one infection in any <p^ 1 (x) gives 
a process with equal recovery rate but smaller infection rate than the contact process 
on G. This process has as many infections as the process on H and by monotonicity, 
has fewer infections than the contact process on G.) Since any graph has a tree for a 
universal cover, this relates the contact process on any graph to the contact process on a 
tree. In particular, among all graphs where every vertex has the same degree n + 1, the 
tree has the best survival probability for the contact process, hence the critical value Ai 
defined below is least for the tree. 

The main reason that the contact process on a tree is worth studying is that it exhibits 
a multiple phase transition. In this respect the behaviour differs remarkably from any 
of the usual interacting particle systems on the lattice. In fact a major recent result 
[BG] says that no such monkey business can happen with the contact process. That 
such behaviour may be the rule for trees, rather than the exception, is made plausible 
by the recent work of Grimmett, Newman and Wu [GN], [NW], which shows a similar 
phenomenon for percolation on a graph which is the integers crossed with a tree. Since 
the contact process on a graph G is very similar to oriented percolation on G x Z, it 
is not surprising that unoriented percolation on the integers crossed with a tree should 
behave like the contact process on a tree. 

The results in this paper are concerned with the existence and location of the multiple 
phase transitions. The theory of the contact process on a tree has by no means been 
tidied up. In particular, I believe but am far from being able to prove that there are 
only two phase transitions. The rest of this introductory section will define the phase 
transitions and delimit what I know about their existence. 

Let T be an infinite tree with a distinguished vertex p called the root. Let 2 T be 
the set of subsets of the vertices of T, and let A C 2 T be the finite subsets of vertices. 
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The contact process can be viewed as a random variable taking values in the space of 
functions £ with domain R + and range 2 T , where is the set of vertices infected at 
time t. If the initial configuration £(0) is almost surely in A, then stays in A almost 
surely for all time. I will not prove statements on this level, or even formally construct 
the contact process; the reader may easily do this from [Du] . The idea is to construct the 
graphical representation, which is merely a set of independent poisson processes. There is 
a rate 1 poisson process for each vertex giving the recovery times at that vertex (a vertex 
ignores recovery times while it is healthy) and there is a rate A poisson process for each 
oriented edge giving the attempted infection times, where an attempted infection from an 
infected node to a healthy node transmits the infection but all other attempted infections 
are ignored. It is also possible to construct the process directly from its semigroup [Li]. 
Here is some notation. 

For any configuration 7] G 2 T , use P J? (£(t) has property X) to denote probabilities for 
the contact process started from the initial condition £(0) = rj. Let P„ denote P{„} in 
this regard. A measure \x is invariant for the contact process if £(t) has law /i for any t 
whenever £(0) has law ji. Since the contact process is an attractive system, it is easily 
seen [Li] that there is a stochastically greatest invariant measure JL that can be gotten by 
taking the limit of the laws of £(£) as t — > oo when £(0) is the configuration with every 
vertex infected. (The contact process, since it is an attractive system, is monotone in the 
following sense. Two contact processes run from initial states where the infections of one 
are a subset of the infections of the other may be coupled so as always to remain in this 
relation. This and its consequences will be used freely in what follows, and sometimes 
without explicit mention.) The following are all possible definitions of critical values for 
the infection parameter A. 

Definitions 1.1 

Ai = inf {A : P p (£(t) ^ for all t) > 0} 
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= inf{A : 


: P p (p G £(t) for arbitrarily large t) > 0} 


A 6 


= inf{A : 


: lim sup P p (p G f (t)) > 0} 

t^oo 


A 2 


= inf{A : 


: lim MTP p {pem) >0} 


A c 


= inf{A : 


: £(t) conditioned on not being empty =>JI ^ 5 } 



where \i is the upper inveriant measure and So is the point mass at the empty set. 

In words: Ai is the value at which the process can survive forever with nonzero 
probability, above A a the root gets reinfected infinitely often, above A& the root is re- 
infected with probability bounded away from zero at arbitrarily large times, above A2 
these arbitrarily large times are all times, and above A c there is complete convergence to 
a nontrivial measure (which has been shown for one-dimensional, and more recently any 
dimensional integer lattice). It is immediate that Ai < A a < A& < A2 < A c < 00, with the 
last inequality following from the classical results one the contact process on the integers, 
which are a subgraph of any infinite tree. I believe the following relations hold among 
the critical values. 

conjecture 1 For any infinite tree, \ a = \ b = A 2 = A c . 

Call a vertex v of an infinite tree T esential if T\{-y} has at least two infinite components. 
Say that a tree has strongly exponential growth if there is some r > 1 such that for any 
v G T there are at least 3 essential vertices w whose graph-distance from v is r. 

conjecture 2 For any tree with strongly exponential growth, either all the above critical 
values are zero, or else Ai 7^ A a . 

The nature of the phase transitions other than Ai is evidently unclear, but regarding the 
first phase transition I believe: 
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conjecture 3 For any tree, the survival probability starting from a single infection is a 
continuous function of A . 

Conjecture 3 is undoubtedly difficult since it includes the case of the one-dimensional 
integer lattice, a recent and difficult result. Theorem 6.6 says that the survival probability 
is continuous whenever Ai ^ A a , thus Conjecture 2 implies Conjecture 3 for trees of 
strongly exponential growth. The reason some kind of growth condition is needed in 
Conjecture 2 is that it is easy to make both Ai and A a equal to the critical value for the 
integer lattice by making the tree contain indreasingly long stretches of vertices of degree 
two. The theorems in this paper make some progress toward proving these conjectures, 
separating Ai and A a , bounding A a and A2 near each other, and proving continuity of the 
survival probability, all in special cases. Here is a summary. 

Say a tree is homogeneous of degree n if every vertex has n + 1 neighbors. For 
homogeneous trees of degree n > 2, Theorem 2.2 states that < Ai < A a . Additionally, 
it gives reasonable bounds for the location of Ai, A a and A 2 . So there are at least two 
phase transitions when n > 2. 

Nonhomogeneous trees come in many shapes. This paper considers two varieties, 
namely periodic trees and Galton- Watson trees. Since there is no single parameter n 
measuring the size of such a tree (the mean growth of a Galton- Watson tree being an 
irrelevant statistic in this setting - Theorem 3.2), bounds on the critical values are harder 
to get at. The main result here is that = Ai = A a = A5 = A2 is possible. The last section 
deals with the continuity of the survival probability as a function of A. This is linked to 
the separation of Ai and A a , and to the question of whether survival is still possible if a 
single edge is severed. The relation is: Ai < A a =>- survival with a severed edge whenever 
survival is possible without a severed edge continuity of the survival probability in A. 
Thus again the process behaves nicely on homogeneous trees with n > 2. 
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2 Homogeneous trees 



Fix an integer n > 2. Let T be a homogeneous tree where each vertex (also called a 
node) has n + 1 neighbors. Pick any vertex p and call that the root; then each vertex 
other than p has one neighbor closer to p (the parent) and n neighbors (children) further 
from p. Define the contact process on T by letting each infected node recover at rate 
1 and each infected node infect each of its neighbors at rate A where A is any positive 
parameter. Denote by £(t) the set of infected nodes at time t > 0. Thus £ is viewed as a 
function from R + to 2 T , or sometimes as a function from R + x T to {0, 1}. Probabilities 
referring to initial state rj at time will be written as P v (£(t) • • •). When the initial state 
can be understood, the subscript will be dropped. 



Theorem 2.1 For all n > 2, < Ai < A2 < 00. For n = 2 the same holds except that 
the middle inequality may not be strict. 



This is a consequence of: 



Theorem 2.2 



i<A, 
n 



^9 + 16/(n - 1) - 1 1 
{11) Ai < — — < 



2(n+l) n-l 



/9 + 16/ (n- 1) - 1 2 /1\ 1/1 
Note that * P «_(_)+ ' 



2(n+ 1) 3 \nJ 3 \n - 1 



1 

(in) — = < X a 
v ; 2^ - 
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4 + 2/(Vn-l) -^8 + 16/(^-1) +4/(^-1)2 

H ^c7^) Aa 



4 - a/8 

iVoie t/iat t/ie Li/S' is asymptotically -=— and is always better than (in) 

(v) In particular, .561722, .425516, .354248 < A 2 when n is respectively 2, 3, 4 

4 

(vi) X2 < min(2, — = -) and lim sup \2\fn < e = 2.71828 . . . 

y/n — 4 rwoo 

(vii) K < max(A2 , WSTTT whm asymptottmlly 3W8JTT m 

2n — 2 2n — 2 a/u 

The upper bound on Ai in (ii) is less than the lower bound on A a on (Hi) except for 
n < 4. The reason it is worth improving the relatively easy bound (Hi) to the messy 
bound (iv) is to separate Ai and A a for n = 3,4, thus showing there are at least two 
phase transitions except possibly when n = 2. The rest of this section is taken up by the 
proof of Theorem 2.2. 

Proofs: 

(i) : See Liggett (1985) page 166. Alternatively, see the remark after the proof of (ii). 

□ 

(ii) : Assign a weight W(£) = ak(£)+bc(£) to each configuration £ having finitely many 
infected nodes, where a and b are constants to be determined later, k is the number of 
infected nodes of £, and c is the number of components in the induced subgraph of T on £, 



i.e. the graph gotten by restricting T to £. For A > (y9 + 16/(n — 1) — l)/2(n+l), it will 
be shown that W(g) is a submartingale, and in fact E(W(f (*+<**)) I f (*)) > (i+eW^*)) 
for a suitable e > 0. Together with the fact that W is a pure jump process with bounded 
jumps, this implies that the probability of W getting arbitrarily large before hitting 
zero is bounded below. (Proof: 1/W is a super martingale when W is greater than an 
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appropriate function of e and the bound on the jump size, so choose a starting time 
when W is large enough and stopping when W gets too small, the optional stopping 
theorem says the stopping time is reached with probability less than 1.) Here are some 
preliminary results. 

(A) The rate of new infections is precisely X((n — l)k + 2c). To see this, count all the 
attempted infections and subtract off the times a node attempts to infect an already 
infected node. The attempted infection rate is A(n + l)k and the failure rate is just A 
times twice the number of edges having infected nodes on both ends. But the number 
of such edges is the number of infected nodes minus the number of components they 
constitute, so the infection rate is A((n + l)k — 2{k — c)) = A(n — l)k + 2c. 

(B) The total recovery rate is precisely k. 

(C) The rate at which new components are formed is precisely k — 2c, if we agree to 
count the component lost by the recovery of an isolated infection as —1 new components. 
With this convention, a new component can only be formed by a recovery, not by a 
new infection, and conversely, components can only be lost by the joining of previously 
formed components along new infections. Consider any component of r infected nodes. 
It contains r — 1 connecting edges. A recovery at a node with j edges incident creates 
j — 1 new components (this covers the trivial case j = as well). Summing over all nodes 
gives J2(j — 1) = (J2 j) — r — 2r — 2 — r = r — 2, where J2 j — 2r — 2 because each edge 
is counted exactly twice. 

(D) The rate at which components are lost by joining is at most (n+l)A(c — 1) < 2Ac. To 
see this, consider a new, acyclic, bipartite graph, H, whose vertices are the components 
of the infection together with all joining vertices, i.e. vertices of the old tree that would 
join two components by becoming infected. The edges of H are all edges of the tree that 
connect a component of the infection to a joining vertex. Now, each joining vertex v, if 
infected, causes the loss of e(v ) — 1 components, where e(v) is the number of edges of H 
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incident to v. Since each joining vertex is infected at a rate of at most (n + 1)A, summing 
over all joining vertices gives a total component loss rate of at most (n+l)A Y^, v eH( e ( v )~ 1) 
which is just n + 1 times the total number of edge of H minus the total number of joining 
vertices, since each edge in the bipartite graph H gets counted exactly once. But the 
total number of edges in the acyclic graph H is just the number of vertices of H minus 
the number of components, which is at most c plus the number of joining vertices minus 
1. The number of joining vertices cancels out and the calculation is done. 

Now fix any A, and calculate whether the infinitesimal expected change in W is always 
positive. 

E(W(£(t+dt)) | f (*)) > W(£(t)) + {[((n - 1)A - l)k + 2\c]a + [k-2c-(n + l)X)c]b} dt. 

This is linear in c and k, so to check if it is positive for fixed A, a, b and every c and k, it 
suffices to check the extreme values for c, namely c = 1 and c = k. In fact it is easier to 
check c = instead of c = I. Then the two inequalities to be satisfied are 

((n-l)A-l)a + 6 > (1) 

((ra+l)A-l)a-(l + (n + l)A)6 > 0. (2) 

By (i), it is safe to assume (n + 1)A — 1 > and hence to change equation (2) into 
a > ((n + 1)A + 1)6/ ((n + 1)A — 1). Plugging this into (1) shows there will be a solution 
when 

((n-l)A-l)((n + l)A + l) 
((n+l)A-l) 

which happens iff (n + 1)A 2 + A — 2/(n — 1) >0 which, for positive A, will be true when 



A > (^9 + 16/(71-1) - l)/2(ra+l). 



Finally, it is easy to see that when this inequality holds, [E(W(£(i + dt)) \£(t)) — 
W(£(t))]/dt is at least a small constant multiple of bn for each fixed A, thus at least 
eW(£(t)) for an appropriate e > and the argument is done. □ 
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Remark: This method also gives a proof of (i). Instead of the upper bound on component 
loss in preliminary result (D), use a lower bound of zero. The same calculation now gives 
a = b/((n + 1)A — 1) and plugging that into equation (1) with the inequality reversed 
gives ((n — 1)A — l)/((n + 1)A — 1) + 1 > which is satisfiable when A < 1/n. Thus for 
A < 1/n, W can be defined to be a supermartingale and the weight must converge to 
zero. 

(Hi): This time use a weighting on all the vertices. It will be shown that the total 
weight of the vertices goes to zero even if their cardinality does not. Let the weight of a 
vertex v be W(v) = n~ d l 2 where d is the distance from p to v, i.e. the number of edges 
in the shortest path from p to v. Let 

M(t) = ^2 W(v)I vE £(t) = total weight at time t. 

V 

Then 

E(M(t + dt)\C(t)) 

= E(£l(vet(t + dt))W(v)\ffl) 

V 

< ^/(«e £(t + dt))W(v)(-l + An 1/2 + n\n- 1/2 ) 

V 

= M(t)+dtM(t)(2y/n\-l). 
So for A < l/2y / n, ~EM(t) goes to zero exponentially fast, hence so does P(p G £(t)). □ 

(iv) and (v): This time the idea is to take into account the coalescing nature of the 
infection in order to improve on the bound in (Hi). The weight of a vertex will depend 
on whether its parent is infected and will count less if so, due to the redundancy. Define 

W(v) = r k (l - ^/(parent of v G f (t))). (3) 
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where k is the distance from v to the root and r and d are parameters that will be 
chosen optimally later. The goal is to show that the expected change in the total weight 
^ T W(v)I{v G £(£)) is always negative. Let 9\{y) and 9 2 (v) denote respectively the 
number of infected parents of v (zero or one) and the number of infected children of v. 
Define 

u(v) = -1 + 9 1 (v)d + 9 2 (v)rd + Afr" 1 -d+(n- 9 2 (v))r(l - d)}. 

Then u{v)r k dt is an upper bound on the expected increment in the total weight from 
t to t + dt due to the recovery of v or infection of other nodes by v. (Breaks down as 
follows: recovery loses weight (1 — 9i(v))dr k , but gains 9 2 (v)dr k+1 ; infecting the parent 
gains at most r k ~ x while losing dr k , and infecting a child gains (1 — d)r k+l while it may 
lose something as well.) 

One way to show that the expected increment in total weight is negative show that 
u(v) < for each infected node. Trying to count this way is inefficient because d must 
be kept small so that a node that is completely surrounded by infection does not seem 
to increase the total weight if it recovers. A solution is to let Y be another parameter to 
be determined later, and define 

U(v) = u(v) + 9 1 (v)Y/r - 9 2 (v)Y. (4) 

The sum over all nodes of U(v)r k is the same as the sum of u(v)r k , because for each parent 
and child node that are both infected, a quantity of r k Y has merely been transferred from 
parent to child. The C/'s are calibrated so as no longer to penalize our strategy at nodes 
with many infected neighbors. 

Now there are 2n + 2 cases to check in the forward equation, one for each value of 
9\(v) and 9 2 (v). For each pair of these values, r, d, A and Y must be chosen to make 
U < 0. Again by linearity, it suffices to consider the cases where 9 2 (v) = or n, so the 
inequalities are: 

-1 + Afr" 1 -d + nr(l - d)} < 
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-1 +nrd + A(r _1 - d) - nY < 
-1 + d + \nr{\ -d) + Fr" 1 < 
-1 + d + nrd + Yr' 1 - nY < 0. 

These inequalities are not too hard to solve. Given n, r, and A, equality in the first 
equation gives an optimal choice for d and equality in the third equation gives an optimal 
choice for Y. It then turns out that equality in both of the remaining equations can be 
achieved for the greatest A when r = 1/ y/n, and hence the strict inequalities are satisfied 
for any smaller A. Plugging in these values 

gives a nasty looking quadratic in A that simplifies to 

-(1 - v^) 2 A 2 + (Ay/n - 2)A - 2 < 

and has the solution appearing in (iv) (divide through by 1 — y/n and use the quadratic 
formula). Values for A, r, d and Y for n = 2,3,4 are given below so that skeptics may 
plug them into the four equations above and check that they actually work. For n = 2 
use A = .561722, r = .7071, d = .434212 and Y = .082293; for n = 3 use A = .425516, 
r = .5773, d = .407781 and Y = .089933; for n = 4 use A = .354246, r = .5012, d = 
.391747 and Y = .088349. 

When all the inequalities are satisfied, it is easy to see, as in the proof of (ii), that in 
fact E(W(£(t + dt)) | f (t)) < (1 - e)W(f (t)) for a suitable e > 0. Then again the expected 
total weight decays exponentially, hence the probability of finding an infection in any 
finite set decays exponentially and the proof is done. □ 

iyi): The 2 comes from the fact that the critical value for A on the one-dimensional 
integer lattice is know to be at most 2 and is an upper bound on A2 by monotonicity. 
For the nontrivial bound, first record two lemmas. 
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Lemma 2.3 Let M be an positive integer valued random variable and pick p < EM 
(with EM possibly being infinite). For any x > 0, let M x be a binomial(M,x) random 
variable. Then there is an e > for which P(M X > 1) > px A e. 

Proof: Pass to the bounded case as follows: let L be large enough so that E(M A L) > p; 
let M' = MM and M' x = binomial ( M', x); then P(M X > 1) > P(M X > 1) and it suffices 
to prove the lemma for M'. Now 

P(M' X > 2) < P(binomial(L, x) > 2) = 1 - (1 - x) L - Lx{l - xf' 1 = 0(x 2 ). 

Also, EM; < P(M; = 1) + LP(M^ > 2) and thus 

P(M' X > 1) > P(M' X = 1) > EM^ - LP(M; > 2) > px 

for small enough x which proves the lemma. □ 

Lemma 2.4 Let H be any nondecreasing function on the nonnegative reals with H(x) > 
x on some neighborhood of 0. Suppose f is a function on the nonnegative reals that 
satisfies 

(1) inf < t < L f(t) > and 

(2) f(t) > #(inf < s <i_ L /( S )) fort> L 

for some L > 0. Then liminf^oo f(t) > 0. 

Proof: For any t and any e > 0, (2) implies there is a sequence ti,t 2 ,...,tfc with 
f(k) > - e2- > fit^) - e2~* and t k E [0, L\. Then f(t ) > f(t k ) - e which 

is bounded away from for small e by (1). □ 

Continuing the proof of (vi), let f(t) = Ps(p G ^(t)) in an altered contact process 
where only n of the root's n + 1 neighbors is allowed to infect the root. The object will be 
to get a recursion f(t) > H(f(t — 2r)). Here 2r is a convenient size for a time increment 
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to be chosen later so as to make H monotone increasing with H(0) = and H'(0) > 1. 
Then Lemma 2.4 with L = 2r will imply that liminf^ 00 p(t) > 0. 

To get this recursion break down f(t) into a product of conditional probabilities as 
follows. Let 

Hx(t) = P(v G £(t — r) for some child v of p) 



and 



H 2 (t) = P(p G | f G £(t — r) for some child t> of p). 



Then f(t) = H 1 (t)H 2 (t). Clearly a lower bound on H 2 that does not depend on t is the 
probability p(r) = P x (y G C( r )) m a system with only two neighboring nodes x and y. 
To calculate this, let Q(s) = P x (y G ^(s) | x G ^(u) for all u < s). Then (5(0) = and Q 
satisfies the differential equation Q'(s) = —Q(s) + (1 — X)Q(s) and solving the equation 
gives 



Q(s) 



A 



1 + A 



'I 



-(l+A)^^ 



(5) 



Obtain from this a crude lower bound on p(r) by requiring x to stay infect up to time r: 



H 2 > p(r) > Q(r)P x (x G ^(s) for all s < r)e r Q(r). 



(6) 



A better bound is given by conditioning on the recovery time s of x, requiring y to be 
infected at time s and remain infected through time r: 

H 2 > p(r) 

> Q(r)e~ r + / P x (x is infected to precisely time s)Q(s)e s ~ r ds 



Q(r)+ f Q{s)ds 
Jo 



A 

1 + A* 



1 _ e -(l+A)r + 



1 + A 



1 _ e -(i+A>) 
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Now for a lower bound on H\ , let M be the number of children of p that are infected at 
time r, not counting the one child that is not allowed to infect p. The previous calculation 
says essentially that each child is infected at time r with probability at least p(r), so 
EM > np{r). If each of these nodes is not allowed to be infected by its parent between 
time r and time t — r then by definition of / such a node infected at time r is infected at 
time t — r with probability f(t — 2r). Thus the actual number of children infected at time 
t — r stochastically dominates a random variable that is binomial(M, f(t — 2r)). Then by 
Lemma 2.3, H^t) > np(r)f(t — 2r) A e for some e > and so f(t) > np(r) 2 f(t - 2r) A e. 
Now writing H(x) > np(r) 2 x A e it remains only to show that H'(0) > 0, for then 
Lemma 2.4 will finish the proof. 

For the first claim in (vi) use the crude lower bound (6) on p(r) to get 
y/H'(0) = V^Pir) 

> ^{YTx) e ~ r{1 - e ~ {1+X)r) 

This holds for any r, and an optimal choice of r is ln(2), giving tJh'(0) > Ay / n/4(l + A), 
hence H'(0) > 1 if A > A/{y/n - 4). 

To get the asymptotic, use the better bound to get 

As n — > oo the minimum A satisfying this inequality will have y/n\ — > e r r~ l and choosing 
r = 1 gives limsup ^/n\ = e. □ 



15 



(vii): First an outline of the argument. By duality, it suffices to show that when 
A > A 2 and A > (3 + \/8n + l)/(2n — 2), a process that lives forever hits any big set. By 
duality again, this means that the process started from an infection at the root and run 
for a long time intersects an independent process initially infected on a sufficiently big 
set and run for a long time. This will be true because each of these processes will at some 
time infect a particular large configuration if it lives long enough (this uses A > A 2 ) and 
the configuration is chosen so that any future scenarios started from that configuration 
intersect with high probability. These two facts become the following two lemmas. 

Lemma 2.5 Suppose A > A 2 . Then for any finite set S and any initial configuration r\, 
Pj/'S' Q £(t) for arbitrarily large t) = infection lives forever). 

Proof: Assume without loss of generality that 77 is finite; an easy limit argument handles 
the infinite case. Fix A > A 2 . The first observation is that for any node v, P v (v G 
£(£) for arbitrarily large t) — p > 0, where P„ denotes P{ v }, the contact process run from 
only v infected initially. This is immediate from the definition of A 2 , since if P„(sup{£ : 
v G < 00) = 1 then P v (v G £(t) for some t>T)^0asT^oo contradicting 

lim sup T P^, (t> G £(T)) > 0. Next observe that P V {S C for arbitrarily large t\ v G 
£(t) for arbitrarily large t) — 1. This follows from the Markov property and the fact that 
P(<S' C £(t + 1) I v G £(£)) is bounded away from zero for a fixed finite S, fixed v G T and 
any initial state of the contact process. 

The last step in proving the lemma is to show that if the process lives forever then 
with probability 1 there is some v that is infected at an unbounded set of times. Assume 
to the contrary. Then there is some configuration rj' such that conditional upon the 
process being in configuration rj' at time t, the probability is greater than 1 — p that the 
process lives for ever but no node is infected at unbounded times. This contradicts the 
definition of p above, since 7/ must have at least one infected node, which is then infected 
at arbitrarily large future times with probability at least p. □ 
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Lemma 2.6 Define a version of the contact process starting with a single infection at 

the root, where no vertex may infect its parent. Let £(t) and £'(t) be independent such 

processes, started simultaneously. Then P(£(i) H £'(t) 7^ for all t) > 0, provided A > 
3 + V8n + 1 
2n-2 

Proof: The argument will show that the set of nodes simultaneously infected in £ and 
£' contains a supercritical branching process. This process Z will be coupled to £ and 
£' and for each v e T the set (t : « e Z(t)} wil be a possibly empty interval called the 
lifetine of t> . The set of nodes that are ever alive is a branching process, oriented by the 
orientation of the rooted tree. Specifically, let p be the single node of Z alive at time 
zero. Let each node v of Z die whenever v recovers in £ or Thus lifetimes in the Z 
process end at poisson rate 2. Dead nodes may not come back to life. A node v in the Z 
process is born at time t when the following conditions are met. The parent of v must be 
alive in the Z process, since some time to < t. The node v must be infected in both £ and 
having just received an attempted infection from its parent in one of them at time t 
(it is OK if the node was already infected when it received the attempt). Furthermore, 
if £' is the process in which the infection occurs at time t, then the most recent infection 
of v in the £ process must have been after time to; similarly if the infection at time t 
occurs in the £ process, then the most recent infection of v in the £' process must have 
been after time t . 

Whether a node is ever alive in the Z process depends on: whether the parent was 
ever alive; the parent's lifetime, t\ — t ; and the attempted infections and recoveries at 
that node in the £ and £' processes during the interval [t ,ti). Since the distribution of 
attempted infections in an interval depends on the interval only through its length, and 
since lifetimes are i.i.d., Z is a Galton- Watson process. The mean number of offspring is 
easy to calculate exactly. The probability that v is ever alive given that the parent is born 
at time t is just the probability that v receives an attempted infection from its parent in 
both £ and £' before its parent recovers. This is just 2A/(2A + 2) times A/(A + 2), the 
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first of these being the probability that even one attempt arrives in time, and the second 
being the race between the other attempt and the recovery of the parent. The branching 
process is supercritical if n times this quantity is greater than 1, which happens when 
n\ 2 > A 2 + 3 A + 2, and solving the inequality gives the condition in the statement of the 
lemma. Finally note that Z(t) C f (i) n £'(*), hence £(t) n f'(t) ^ for all t when Z lives 
forever. □ 



Lemma 2.7 Let r] be the configuration where the infected nodes are those at distance 
precisely r from the root, for some r > 0. Let A be as large as in Lemma 2.6, and let £ 
and £' be two independent versions of the contact process run from initial state rj. Then 
for any e > 0, r can be chosen large enough so that inf Sjt P(£(s) fl £'(£) ^ 0) > 1 — e. 

Proof: By duality P(f(s)nf'(*) 7^ 0) = P(f((s + t)/2) nf ((s + t)/2) ^ 0), so it suffices to 
consider the case s = t. By monotonicity, the processes dominates altered versions where 
no node may infect its parent. These processes proceed independently on the subtrees 
below each of the (n + l)n r_1 nodes in 77, and on each subtree £ and £' are versions of the 
process described in Lemma 2.6. By independence and the conclusion of Lemma 2.6, r 
can be chosen large enough so that £ and £' intersect for all time on at least one of the 
subtrees with probability > 1 — e. □ 

Proof of (vii): Convergence in distribution is determined by finite-dimensional marginals, 
which by inclusion-exclusion are determined by the probabilities of infection existing in 
each finite set. So it suffices to show for finite sets S C T that P J? (5' fl £(£) 7^ 0) — > 
T^S* fl £ 7^ 0) times the survival probability of 77. By duality, this is just the product of 
the survival probabilities of S and 77. 

Let e > be arbitrary and choose r as in Lemma 2.7. Let £ and £' be independent 
versions of the contact process run from initial states S and 77 respectively. Let 

t = inf{t : t) G ((f) for all v with d(p,v) = r} 
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a = 



inf{t : v G £'(t) for all v with d(p,v) = r}. 



Let £" and be independent contact processes run from the initial configuration in 
Lemma 2.7, coupled to £ and £' respectively so that C £(i + r) whenever r < oo 
and C'(t) C f (* + <t) whenver a < oo. Then P(Sn£(T) ^ 0) = P(f (T/2) nf (T/2) ^ 0) 
by duality, 

= // P(a eds)P(r edt)P(£(T/2)n£(T/2) ^(&\a = s,t = t) 

J Joo>s,t>0 

> ff P(ffGrfs)P(r6rfi)P(('(T/2)n((T/2) ^ | a = s, r = t) 

J JT/2>s,t>0 

> f f P(a G ds)P(T G dt)P(C(T/2 -a)D C(T/2 - t) ± 0) 

J JT/2>s,t>0 

> ff P((7 € ds)P(r G df)(l -e) 

7 JT/2>s,t>0 

= (l-e)P((7,r<r/2). 

The last expression converges to (1 — e) times the product of the survival probabilities 
from 5 and 77 according to Lemma 2.5, since a and r are independent. Since e is arbitrary, 
this shows liminf t P V (S n 7^ 0) is at least the product of the survival probabilities. 
Duality gives trivially that limsup t P v (Sr\£(t) 7^ 0) is at most the product of the survival 
probabilities, and the theorem is proved. □ 



3 Nonhomogeneous trees 

The upper and lower bounds Theorem 2.2 gives for the critical values of A 2 on homoge- 
neous trees are within a constant factor. Since the constant may be quite large, it was 
reasonable to get some more accurate values for small n. In the case of nonhomogeneous 
trees, more work is needed just to get the bounds within a constant asymptotic factor, 
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as shown by the example in the following paragraph. This section is therefore devoted to 
improving the bounds so that they are again within a constant factor of each other. This 
section does not discuss bounds on Ai per se, but when the trees are nonhomogeneous 
enough, the better upper bounds on A2 get pushed down by more than a constant factor 
below the upper bounds on Ai that can be obtained using the methods of the previous 
section, which means that the correct value of Ai in these cases is also more than a con- 
stant factor lower than can be gotten by the methods of the previous section. In fact, 
the last example of this section shows that Ai and A 2 may both be zero on a Galton- 
Watson tree whose generating function has all moments. For ease of exposition, large 
constants are chosen with reckless abandon, leaving the interested reader the exercise of 
determining better constants. 

To see that there is indeed a problem, try to apply the methods of the previous section 
to the following tree. The root has n children, each of which has only one child. Each 
of the n grandchildren has n children, and in general the number of children alternates 
between 1 and n along any direct lineage. 

For an upper bound on A 2 , adapt the method used to prove (iv). Let f(t) = P(p G 
£(£)) as before, and let g(t) = Pt'(p e where T' is the subtree rooted at a child 
of the root of T. Then the same argument as before gives f(t) > H(g(t — 2r)) and 
g(t) > J(f(t — 2r)) where H is the same as in the proof of (iv) and J is similar but 
based on only 1 child. Then monotonicity of H gives f(t) > (H o J)(f(t — 4r)). Now 
since (Ho J)'(O) = H'(0)J'(0), Lemma 2.4 together with equation (6) imply that A > A 2 
whenever the quantity Q(r) in equation (5) satisfies e~ Ar Q(r) 4 ■ 1 • n > 1. This means A 
must be of order n -1 / 4 as n gets large. Thus if G is the geometric mean of n and 1, an 
upper bound on the second critical value for A is 

A 2 < en -1 / 4 = cVG. (7) 

For trees with longer periodicity, the same argument yields A 2 < c\fG where G is the 
geometric mean of the whole sequence of family sizes. 
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On the other hand, the somewhat ad hoc methods in (Hi) and (Hi') can also be 
adapted to this tree to give lower bounds on A2. The reader is invited to check that 
the best simple weighting scheme along the lines of (Hi) for trees whose generation sizes 
alternate between a and b has the weight of a vertex with a children equal to y/a times 
the weight of each of its children, and the weight of a vertex with b children equal to \fb 
times the weight of each of its children (this scheme is not optimal when the periodicity 
of the generations is three or more). Under this weighting scheme, the total weight goes 
to zero almost surely whenever A < l/(y/a + Vb). When a = 1 and b — n, this gives the 
lower bound 

1/(1 + Vn) < K < A 2 . (8) 

Thus the bounds in (7) and (8) have different asymptotics as n — > 00, one decreasing as 
rT 1 ^ and the other as n -1 / 2 , in contrast to the case where the tree was homogeneous 
and the bounds were always within a constant factor. 

Which of these asymptotics for A2 is sharp, if either? The somewhat surprising answer 
is that (8) is sharp, even though (7) is clearly a better representative for the growth rate 
of the tree. In fact for reasonably regular nonhomogeneous trees, the critical value of A 2 
is determined by M = the (essential) supremum of the number of children of a vertex 
and is at most a constant times rM" 1 / 2 where r is a logarithmic measure of how far apart 
vertices with M children are from each other. 

There are several kinds of nonhomogeneous trees one might wish to study. The 
periodic trees mentioned earlier in this section are one such class and Galton- Watson 
trees are another. Theorem 4.1 below will be used to obtain results about both of these 
cases. Since the proofs for Galton- Watson trees are harder, these will be given first and 
a brief version of the other will follow. 

Let T be a Galton- Watson tree with generating function / = J2n>o a nX n , i.e. there 
is a root p and each vertex v has a random number C(v) of children, where C(v) are 
independent and each is equal to n with probability a n . Assume f'(l) > 1 so that the 
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tree is infinite with probability p > 0. To run the contact process on a Galton- Watson 
tree that begins with only the root infected, choose T(u;) according to the Galton- Watson 
distribution and let £(t,u \ T(cu)) be the contact process run on the tree T(cu) starting 
with only the root infected. 

Proposition 3.1 For a given offspring generating function f , let 

q(T) = liminf P(p G f(t) | T(u) = T). 
Then either q is almost surely zero or else P(q > | |T| = oo) = 1. 

Proof: Let p± — P(q — 0). If the root has n children with subtrees Ti, . . . , T„ then the 
Tj are independent, identically distributed with distribution independent of n. Further- 
more, if q(Tj) > for any % then q(T) > 0. To see this use the same argument as in (vi) 
of the last section for any fixed r to get 

P(p G m) > e- 2r Q 2 PT t (P G ^(t - 2r)). 

Taking liminf's over t gives q(T) > a constant times ?(Tj) for each %. Now by indepen- 
dence of the subtrees Tj, p\ = P(g(T) ^ 0) < J2n a n{pi) n = f(Pi)- That means that 
either p\ — 1 or pi is bounded above by the unique fixed point of / in (0, 1). In the first 
case q is almost surely zero and in the second case P(q = 0) is at most P(|T| < oo); since 
it is also at least P(|T| < oo), these two must be equal and the proposition is proved. □ 

In light of this, it is natural to define A 2 (/) to be the inf of A such that q is almost 
surely nonzero for the contact process with parameter A run on a Galton- Watson tree 
with generating function / conditioned to be infinite. The following upper bound on \ 2 
can now be obtained. 

Theorem 3.2 Let r be the maximum of 2 and c 2 ln(l/na n )/m(/'(l)). There are con- 
stants c 2 and c 3 such that for any n > 1, A 2 < c 3 Jr\n(r) \n(n)/n. 
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The class of periodic trees may be defined in any of a number of ways so that a 
corresponding theorem is true. Here is one possible course broad enough to include the 
example at the beginning of this section. Let G be any finite graph with a root, p, a 
second distinguished vertex, a, and another set of distinguished vertices v±, . . . v m distinct 
from each other. Suppose that for some ji and j 2 a path from a to each Vi of length at 
most j 2 exists that is disjoint from a fixed path of length j 1 connecting a and p. Let n 
be 1 less than the number of neighbors of a and let j = ji + j 2 . Construct a graph that 
will be a m-ary tree of G"s as follows. 

Let T^(G) = G. For each i > 1 let T^ +1 \G) be a copy of G together with m copies 
of TW(G) but with p in the i th copy of T/W identified with Vi in the copy of G, so there 
remains only a single root in T^ +1 \ namely the one from the copy of G. The increasing 
limit of these trees (as seen from the root) is the desired graph T(G). If G is a tree then 
so is T(G). 

Theorem 3.3 For G, n, j, m and T(G) as above, let r be the maximum of 2 and \j / ln(m)] . 
Then there is a constant c 4 such that A 2 < c 4 yV ln(r) ln(n)/n. 

The proofs of Theorems 3.2 and 3.3, will be given in section 5 since it relies on the results 
from section 4 on finite stars. 

4 Finite stars 

Let T be a star of size n, i.e. T consists of a root, p and n other vertices each connected 
only to p. Although any finite system is eventually trapped in the state of zero infection, 
this star turns out to be capable of storing the infection for a long time when A is an 
appropriate constant times 1 / y/n. Since the application of these results is to get the two 
asymptotics is equations (7) and (8) to agree within a constant factor, there is no loss 
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of generality in assuming n is greater than 64, say, so that n -1 / 2 and n~ 1//4 are actually 
distinguishable. It will also be convenient to restrict A to being less than 1; again, nothing 
is lost, since all the critical values are less than 1 for n > 64. 

Before stating the theorems precisely, here is a heuristic explanation of the results. 
Suppose n is large and let A = a/y/n for some constant a. Let x be the number of 
infected neighbors of p divided by \/n and pretend that x is a continuous rather than a 
discrete parameter. When the root is not infected, x decreases deterministically at a rate 
x (i.e. x = e^ t xo) and when the root is infected, x approaches a at rate 1 (i.e. x — a = 
e~*(rr — a)). The root, when infected, recovers at rate 1, and when healthy, is infected 
at rate (a/ y/n)(xy/n) = ax. In other words, the state of infection is a Markov process on 
{0, 1} x [0, a] where the second coordinate is x and the first coordinate is if the root is 
healthy and 1 if the root is infected. Its natural scale can be calculated exactly, giving a 
function W : {0, 1} x [0, a] — > R for which W(t) is a martingale. Writing f(x) = W(l, x) 
and g(x) = W(0,x) yields the equations 

(a-x)f' + (g-f) = 

-xg' + ax(f - g) = 

which have the unique (up to afnne transformation) solution 

/(*) = f e- as (a - s)- 2 ds ; g(x) = f(x) - e~ ax (a - x)' 1 . (9) 
Jo 

From this it is easy to read off the behaviour of x(t) and in particular, to see that for 
k < I < m, the chance of hitting x = a/k before hitting x = a/m, starting from x = a/l 
is on the order g-^C 1 /'- 1 /™), 

Returning to the actual situation, where the number of infections is integral, there 
are roundoff errors in the statement of an analogous result due to the greatest integer 
function, and roundoff errors in the proof due to the fact that the functions / and g 
are no longer very nice. To kill the roundoff error I require n to be larger than some N 
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and choose sequentially a bunch of constants satisfying certain relations. The constants 
involved are named C5 through C13 and are fixed throughout once they are chosen. These 
can be evaluated easily by tracing the argument, but the value of N that this yields is 
far from optimal. 

Theorem 4.1 Let c 9 ,c w ,Cu be constants satisfying 1/4 > l/c 10 + 1/cn > l/c 10 > 
l/c 9 > and let c 5 satisfy l/c 5 < l/c w — l/c 9 . For n a positive integer and a a positive 
real number, run the contact process on a star of size n with A = a/^/n and initially y 

nodes infected, where y = |_— ^— J . Then the conditions 

C10 

n > (1/cio — l/ c 9 — V c 5) 1 an d 4 < a < \fn 

imply that the probability of the number of infected nodes dropping below a\Zn/c 9 before 
reaching y + a^/n/cu is at most e a2//cs . 

Corollary 4.2 If n is large enough then beginning with only p infected at time zero, the 
probability of the number of infected nodes staying above a\/n/c 9 during the time interval 
l<t< e a2 / C5 /2cn is at least e _1 /5. 

Remarks: (1) The important fact here is that the infection lives for a time exponential in 
a 2 with a probability bounded away from zero; this is what creates the order ^J\n(n)/n 
bound in Theorems 3.2 and 3.3, via equation (11) in the next section. (2) As observed 
earlier, requiring n to be large and A = a/ y/n to be less than 1 gives away nothing. 

Proof of corrolary: The probability that p remains infected until at least time 1 is e _1 . 
Remember this value: p\ = e _1 . Conditioned on that, the probability of finding at least 
y nodes infected at time 1 is at least the probability of a binomial with parameters n 
and Q(l) being least y, where Q(l) = {a/ (a + ^fn)){l - e^ 14 ^™)) > (1 - e - 1 )a/2^/n is 
as in equation (5). The mean of this binomial is at least a^/n/A and the variance is at 
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most nQ(l) < an/ (a + y/n) so if n is large enough, the binomial is at least y = a^/n/cio 
with probability arbitrarily close to 1, say 4/5. Remember this too: p 2 = 4/5. Next, 
conditional on all that, Theorem 4.1 implies that the number of times the number of 
neighbors of p goes from y to y + a^/n/ Cn before becoming less than a^/n/ c 9 dominates 
a geometric random variable with mean at least e a '° 5 . Finally, if n is large enough to 
apply Theorem 4.1 then the waiting times for each new infection are independent from 
everything else and are at least exponential with mean 1/a^/n (the infection rate summed 
over all neighbors of the root is at most a^/n). The mean sum of a^/n/cu of these waiting 
times e a2//cs times over is thus at least e a2//c5 /cn. Any sum of a geometric random number 
of i.i.d. exponential random variables is at least half its mean with probability at least 
1/4 if the geometric has large enough mean. So the probability that the sum of the 
waiting times is at least e a2 / C5 /2cn conditional on there being at least e a2//cs transitions 
from y to y + ay/n/cn is at least 1/4. Call this p 3 . Now multiply p±, p 2 and p 3 to give 
the result. □ 

Proof of Theorem 4.1: For each state £, let /(£) be 1 if the root is infected and otherwise. 
Assign each state £ the weight 

where x is the number of infected neighbors of p. This W is not a martingale like 
the heuristic one was, but reducing the exponent from ax/\/n to ax/10\/n makes it 
a supermartingale by, as will be evident, a comfortable margin. Note that by the size 
assumption on a, a/10\/n < .1 so e 

a/HV"_i < i.2a/l(Vn, hence ^/n(e a/loVn -l)/a < .12 
and W is always positive. Let r be the first time that p has at least y + a\/n/cn < ay/n/A 
infected neighbors. The state £(t) is a continuous time Markov chain on the space 
{0, 1} x {0, 1, ... , n}, where £ is represented as the pair (/(£), number of infections). The 
transition rates are: 

(0,x) — > (0, x — 1) at rate x 
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(0,x) 


->(!,*) 


at rate 


ax 
y/n 


(l,x) 


— (0,a;) 


at rate 


1 




(l,x-l) 


at rate 


X 


l,x) — 


(l,x+l) 


at rate 


a 

Jn 



'n — x) 

For W(£(t A r)) to be a super martingale, two inequalities must be satisfied. The transi- 
tions from (0,x) yield the inequality 



ax \/n 



^a/lOV* _i)> x ( e a/10^ _ ^ 



yjn a 

which is satisfied with equality. Setting k = ( v /n/a)(e a / 10v/ ™ — 1), the transitions from 
(1, x) yield 



k + x(l- k){e a/wVTl - 1) < 4=(n - - A;)e- a/10 ^(e a / 10 ^ - 1) 

and dividing out by 1 — k and by e a / 10v/ ™ — 1 gives 

7 '' +x < -^{n-x)e- a ' w ^ 



[1 - k)(e a l w ^ - 1) - y/n y 



x + ^/;; nA - 7\ < ^={n-x)e- a ^ n 

e a/10yn 

e a/loVE y/nx + n , in = < an - ax 

a- jn{e a l w ^ - 1) ~ 



e a/10Vn 



v ^( e a/10VS_ !) 



Since a < ^/n, the left hand side is at most (1 + e 1 l 1 ®)^Jnx, and since a > 4, the right 
hand side is at least an(l — e 1 / 10 /(4 — y / n)(e 2 / 5v/ ™ -1 )) > an/2 (the denominator on the 
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right hand side has a local minimum at either endpoint of [4, y/n\ with the absolute 
minimum at 4). Thus the inequalities are satisfied as long as \.ll^Jnx < an/ 2 which 
is always satisfied when x < a/Ay/n, which is always satisfied up to the stopping time. 
This establishes that W(£(t A r)) is a supermartingale. 

To finish proving the theorem, suppose that p has y infected neighbors at time t, 
where y is as in the statement of the theorem. There is no loss of generality in the 
worst-case assumption that p is not infected at time t. Then W(£(t)) = e~ ay ^ 10 ^ n and 
by definition of y, this is between e - a2/10ci ° and e -a a AOcio+a/"Vn fiut w ^ > e -a 2 /io C9 
the first time the number of infected nodes falls below a^/n/cg. Use the fact that W is 
always positive together with the optional stopping theorem see that W can reach the 
value e -a,2 / 10c9 with probability at most exp{— a 2 /10cio + a/10y/n + a 2 /10cg). This gives 
the desired conclusion as long as a 2 /10cio — a/10y/n — a 2 /10cg > a 2 ' jc^. Factoring out 
a 2 /10 yields the equivalent condition l/c 10 — 1/cg — l/c 5 > l/ay/n and since a is at least 
1, the condition n > (1/cio — l/ c 9 _ l/ c 5) _1 finishes the proof. □ 

One more lemma will be needed for the next section, so record it now. 

Lemma 4.3 Let n, a and A be as in Theorem 4-1 and run the contact process on any 
graph containing the finite tree T where T consists of a star of size n with the root infected 
at time zero, to which has been added a single chain of length r of descendants of some 
child v\ of p, denoted v 2 , . . . , v r . Then 

P(v r is infected before time e a2/c5 /2c n ) > e _1 /5 - (1 - c r \ r f ea ' 2/C5 )A=ur-2 ( 10 ) 
where c is a constant independent of everything else. 

Proof: By monotonicity, it is sufficient to prove the statement for a modified contact 
process in which t> 2 cannot infect v±, so the process restricted to the star of size n is 
unaffected by what happens at v 1 and below. Also by monotonicity assume without loss 
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of generality that the graph is actually equal to T. Let Bt be the event that the number 
of infected nodes of the star at time t is at least a^fn/c§, B = f]{Bt '■ 1 < t < e a2 / C5 /lcn}, 
and let Tt be the a-algebra of events up to time t. Then P(v r is infected at some time s G 
(t, t + r) | J-'t, Bt) is bounded below by the probability that p becomes infected before time 
t + 1 given J-'t, Bt, times the product of the probabilities that Vi infects Vi + 1 before 
becoming uninfected and this happens within time 1. Working out these probabilities 
gives a lower bound of (cA) r for a universal constant c. 

Now let L = + e a2//c5 )/2cnrj — 1, and consider a sequence of times = {1, 1 + 
r, l + 2r, 1 + Lr}. For each such time U, P(v r g £(£) for U < t < U+r \ T tl , B ti ) < l-c r A r , 
and so P(v r £ f (t) for 1 < t < 1+Lr \ B) < (l-c r A r ) L , since each B u G T u and B G fl B^. 
Corollary 4.2 gives P(jB) > e~ 1 /5 and since L is at least as big as the exponent in the 
right hand side of (10) the lemma is proved. □ 



5 Proofs of Theorems 3.2 and 3.3 and examples 

Proof of Theorem 3.2: Assume without loss of generality that a n > (or else r is 
infinite). Also, there is no loss of generality in assuming n to be large enough to apply 
Corollary 4.2, since c 2 and c 3 may be taken large enough so that c 3 ^Jr\n(r) ln(n)/n is 
at least 2 for any n that is too small, and 2 is an upper bound on all critical values on 
trees. By Proposition 3.1, is suffices to prove that liminf^ 00 P(p G > without 
conditioning on T. Let 

g(t) = P(p G £(t)) > a n P(p G £(£) | p has at least n children). 

To calculate g(t), condition on the root having at least n children until further notice. 
Let c be the constant in Lemma 4.3. Treat r as a parameter to be determined later and 
choose a = a(r) so that 

e a2/c 72c n r > 2 + 10(c r (a/ v ^) r )- 1 . (11) 
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Using the inequality (1 — x) 1 ^ < e -1 , observe that (11) forces the right hand side of 
equation (10) to be at least c§ = e _1 /4 — e~ 10 when A is at least aj^fn. To satisfy (11), 
it suffices to choose a > ^/c 5 ln(2c n r ■ 10 • c^ r v /n r ) which is at most c 3 V^ In r Inn for an 
appropriate C3 as long as r > 2. Set A = a/^/n. If this is sufficient to force P(g(T) > 
I |T| = 00) = 1 then A2(/) < c 3 ^Jr\n(r) ln(n)/n as desired. Now the root has at least n 
children; select n of them and ignore the rest (by monotonicity) . Among all the vertices 
at distance r from p, look for those having at least n children; call these vl, . . . , v-j. where 
j is the random cardinality of the set of such descendants. From the facts that p has 
n children, that each descendant has an expected /'(I) children and that a node has n 
children with probability a n , it follows that 

ej > naM'iiyr 1 - 

Now let M be the random number of vertices among . . . , v 3 r that are infected before 
e " 2 /c5^2c n . By the choice of a, Lemma 4.3, and the above expression for Ej, 

EM > c 8 na n (/'(l)) r - 1 . 

Now the argument proceeds as in (iv) of the previous section. Fix t and Let x be the 
minimum value of g(s) for < s < t — e a2//c5 /2cn. Recursively calculate a lower bound 
on g(t) as follows. 

Ignoring all but the first infection of each v l r by its parent, any of these that are infected 
at some time s < t— e a2//c5 /2cn will evolve independently from time s to time t— e a2//c5 /2cn 
and then be infected with probability at least x = mf{g(s) : s < t — e a2//c5 /2cn}. Thus 
the distribution of the number of such nodes infected at time t — e a2//c5 /2cn given M 
dominates a random variable M x that is binomial (M,x). Thus for small enough x, 
P(M X > 1) > x(EM — e) for arbitrarily small e. Finally, if some v\ is infected at time 
t — e a2 / C5 /2cn, then the probability of finding p infected at time t is bounded below 
by the probability, pi, that the contact process starting with only v\ infected at time 
t — e a2 / C5 /2cn infects p at some time s with t — e a2 ^ cs /2cn < s < t times the probability 
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P2 that p is infected at time t given infection of p at some such time s. Then p\ is 
at least c$ by Lemma 4.3 and equation (11) with the star of size n centered at v l r this 
time. Meanwhile, P2 is at least (e~ 1 /5)e~ 1 a 2 /(c$ + a 2 ), which will be called C13. (See 
this by applying Corollary 4.2 together with the fact that the probability of p becoming 
reinfected between times t — 1 and t given at least a^/n/cg infections during this time, 
and then staying reinfected, is at least a 2 /(c 9 + a 2 ).) 

Thus as x — > 0, g(t) is bounded below by CgCi3:rE(M) > CgCi3a;na n (/ / (l)) r_1 . Now 
choose c 2 so that putting r = c 2 m(l/na n )/m(/'(l)) forces CgCi 3 na n (/ / (l)) r ~ 1 to be 
greater than 1. Applying Lemma 2.4 then gives inLg(t) > 0. □ 

Proof of Theorem 3.3: Passing to a subtree if necessary, assume without loss of generality 
that the parent of a is on the unique path of length j\ connecting p to a and that there 
is a unique path connecting a to each Vi and it passes through a child of a, not through 
the parent of a. 

For a positive integer q to be determined later, consider T(G) as being put together 
from units of size instead of from units of size G. Then p has at least n neighbors 
and furthermore it has at least m q descendants at distance at most qj, all of which 
continue recursively the property of having the right number of neighbors and well- 
endowed descendants. 

Now use the argument for the previous theorem with m q instead of a„(/'(l)) r ~ 1 . 



Then a must be at least c^yqj ln(gj) ln(n)/n as before, and the condition needed for 
Lemma 2.4 is that q is that CgG^m 9 > 1. This means that q = c' / ln(m) suffices, and 



A couple of examples conclude this section. The first shows that the the upper bound 
for A2 in Theorem 3.3 can be asymptotically sharp. In fact for this example it is shown 
that Ai is within a constant factor of the upper bound for X 2 as n — > 00, so Theorem 3.3 
provides asymptotically tight bounds on both critical values. 




setting r = qj/d proves the theorem. 



□ 
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EXAMPLE 

This example calculates Ai for a particular tree in order to show that the order y / ln(n)/n 
bound given by Theorem 3.3 may be sharp to within a constant factor. Let T be a 
homogeneous binary tree to every vertex of which has been added n children that have 
no children themselves. Thus each vertex has n + 2 children, only 2 of which have 
descendants. To get a lower bound on Ai, define the weight of a configuration as the 
sum over all nodes that have children of 1 if the node or any of its childless children are 
infected and otherwise. The following partial converse to Corollary 4.2 will be needed. 

Proposition 5.1 Let T be a star of size n with A = aj^Jn < 1 as usual. There is an N 
for which n > N implies that the probability starting from configuration r\ of the infection 
dying out before time ln(n) + 2 (In \r)\ V 4) is at least e~ 4a2 /10. 

Proof: First observe that from any configuration rj, P J? (|^(2(ln |?7| V4))| < 2ay / n) > 3/4. 
To see this, couple the process on T to one on another star of size n, T', where the root 
of T' always remains infected, but none of the children of p' in T' starts out infected. 
The coupling has corresponding children of p and p' recover at the same times and has 
an infection occurring in T' whenever one occurs in T but not necessarily vice versa. 
When every child of p has recovered once, the number of infected children of p is at most 
the number of infected children of p'. This happens by time 2 (In |^| V4) with probability 
(1— e~ 2 ( ln l ?? l v4 ))l r 'l > 7/8. But the number of infected children of p' at any time is a random 
variable converging upwards to a stationary distribution that has mean approximately 
a\fn and for large enough n has probability less than 1/8 of being above 2a^/n. Thus 
the coupling establishes the observation. Now from a configuration where at most 2a\fn 
nodes are infected, it remains to show that the probability of the infection dying out 
within time ln(n) is at least e~ 4a2 /7.5. The probability that no reinfection ever occurs is 
at least (1/(1 + A)) 4a ^™ > (1 - a/y/n) Aa ^ « e- 4fl2 . If no reinfection occurs, the infection 
dies out within time ln(n) with probability at least (l- e - ln ( n )) 2a ^™ = (1-1/ ' n ) 2a ^ > e ~ 2 
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since a < y/n by assumption. Multiplying together the conditional probabilities gives 
e-4a 2 (3/4)/e 2 > e" 4a2 /10. □ 



What is needed below is actually an altered version of this proposition saying that 
even if p is given 3 additional neighbors that are always infected, the probability of the 
infection dying out except on the additional 3 neighbors in the allotted time is still at 
least e~ 4a 7l0. The reader may fill in the details to beef up the proof. 

Going back now to the example, consider any node v that contributes 1 to the weight 
of the configuration either by having an infected child or by being a node with children 
that is itself infected. There are only three neighbors of v that may increase the weight of 
the configuration if they get infected by v, so the expected contribution of v to increasing 
the weight of the configuration in a time period of length 3 ln(n) is at most 

9Aln(n) = 9aln(n) / \fn. 

The probability of v decreasing the weight during this time because the infection dies 
out on v and its childless children is at least e _4a2 /10, by the beefed up version of the 
previous proposition (since \r]\ is necessarily at most n). Now pick a = ^J\n(n)/10. This 
makes 

e - 4a2 /10 = n A /\Qy/n/2 > 90n(ra) ln(n)/l(Vra = 9ahi(n)/y/n 

for large enough n. Thus any node contributes an expected weight that is less than 1 
before it and its childless children all recover. The usual supermartingale argument then 
implies that the infection dies out almost surely and so 

Ax > y/ln(ri)/10y/n (12) 

for large enough n. 

On the other hand, T is a periodic tree: T = T(G) for the graph G with root p = a 
and n + 2 other vertices all connected only to the root, two of which are labelled v 1 and 
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t> 2 . In the notation of section 3, m — 2, j — 1 and what is called n in Theorem 3.3 is 
n + 2. Then Theorem 3.3 gives 

A 2 < c^Vlnn/y/n. (13) 
Thus for this class of trees, both (12) and (13) are sharp to within a constant factor. 
EXAMPLE 

This example is thrown in to show that even medium-sized tails on the generating function 
of a Galton- Watson tree can force the critical values down to zero. For any positive 7 < 1 
let 

n 

be the generating function for the number of children, where c is a normalizing constant. 
Then ln(l/na n ) ~ n 7 , so Theorem 3.2 gives A 2 < c^Jn 1 " 1 ln(ra) which goes to zero as 
n — > 00. Thus A 2 is less than any positive number, hence Ai = A 2 = 0. 



6 Continuity of the survival probability 

Continuity from the right of the survival probability as a function of A is immediate from 
the monotonicity, since versions of the process for each A can all be coupled so that the 
event of survival is continuous from the right in A. At the moment, I can only prove 
continuity from the left when the phase transitions at Ai and at A a are distinct. It is also 
necessary to assume that the tree is homogeneous or periodic. In particular, the survival 
probability is a continuous function of A for homogeneous trees with n > 2. I believe 
that survival probability is continuous for arbitrary trees. 

I will present the argument that \\ < X a implies continuity of the survival probability 
only for homogeneous trees; the generalization to periodic trees is straight-forward but 
tedious. In the foregoing discussions, let p(X) denote the survival probability for the 
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contact process on an n-ary homogeneous tree with only the root infected initially. Begin 
with a counting lemma. 

Definition 6.1 For a finite set S C T, say a vertex v G S is surrounded in S iff each 
of the n + 1 components of T \ {v} intersects S. 

Lemma 6.2 Let S be a finite set of vertices of a homogenous tree, T. Then the number 
of surrounded vertices of S is less than \S\/n. 

Proof: The proof proceeds by repeated reductio ad absurdum. Suppose for a contra- 
diction that the result is false. Let S be a counterexample with as few vertices as 
possible, and among such counterexamples, let S have as small a value as possible for the 
sum J2 v ,wes d(v, w). The induced subgraph on S is just the vertices of S together with 
whichever edges of T connect two vertices in S. 

I claim that the induced subgraph on S must be connected. Suppose not. Let v G T\S 
be a vertex for which T \ v has at least two components intersecting S (the existence of 
such a v is equivalent to S failing to be connected). There are two cases. Firstly, suppose 
more than two components of T \ v intersect S. Writing S = \JSi, where each Si is the 
intersection of S with a single component of T\v , deletion of any single one of the Si from 
S does not affect which of the remaining vertices are surrounded. Then by deleting the 
Si with the smallest fraction of surrounded nodes, a smaller counterexample is obtained, 
which is a contradiction. The other case is that exactly two components of T\v intersect 
S. Let Si and S 2 be the intersection of S with these two components and let Vi and v 2 
be the neighbors of v in the two components. Let be any graph-automorphism of T 
sending t> 2 to v and v to v\. Let S' — Si U 0[5*2]. Then vertices in Si are surrounded in 
S iff they are surrounded in S', and vertices in S 2 are surrounded in S iff their images 
under are surrounded in S'. So S' is a counterexample with as many vertices as S but 
a smaller value J2 v ,weS d(v, w), which is a contradiction. 
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Now that it is established that the induced subgraph on S is connected, count the 
edges of S in two different ways. First, since the induced subgraph is a tree, there is 
one fewer edge than there are vertices. Counting oriented edges doubles this, so there 
are 2\S\ — 2 oriented edges. On the other hand, there are n + 1 oriented edges leading 
out of every surrounded vertex and at least one edge leading out from every vertex, 
so the number of oriented edges is at least j^l + n(number of surrounded vertices), 
thus 2\S\ — 2 > \S\ + n(number of surrounded vertices), and there are less than \S\/n 
surrounded vertices. This is not a counterexample, thus no counterexample exists. □ 

The rest of the discussion deals with the contact process on a tree with a severed edge. 
For each positive integer r, let £^ be a contact process run from an initial infection at 
the root with an edge severed at distance r from the root, i.e. no infection passes over 
that edge. Write £( r;A ) or £( ;A ) when it is necessary to emphasize the choice of A. An 
adjacent edge counts as being at distance one. The main lemma on continuity from the 
left is the following. 

Lemma 6.3 Let T be a homogeneous tree. For a fixed \, let ^ be as above, and suppose 
the survival probability for the infection in £^ is positive for some r. Then the survival 
probability for the unaltered contact process is continuous from the left for that value of 
A. 

Remark: There is a corresponding lemma in percolation theory stating that the perco- 
lation probability is continuous provided that percolation in Z d implies percolation in a 
half-space. 

Proof: First note that r can be taken to be 1, since the initial infection in the £^ process 
has a nonzero chance of moving to a distance r from the severed edge before it dies, thus 
dominating a ^ process a postive fraction of the time. 
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The next claim is that if £^ survives with positive probability at a particular value 
of A then it survives with positive probability for some smaller value A'. To find such a 
A' begin by setting p = p(X) equal to the survival probability for ^ 1;A ^ and pick t large 
enough so that 

P(ie (1;A) WI>10(l + A)/Ap)>3p/4. 

such a t exists, since — > oo almost surely when the infection lives forever. Pick 
A' G [A/2, A) and large enough so that 

P(\^ X '\t)\>5(l + X')/X'p)>p/2. 

This is possible because p/2 < 3p/4, 5(1 + \')/\'p < 10(1 + A)/Ap, and the distribution 
of i^\t) is continuous in A for fixed t. 

For this paragraph, let the infection rate be understood to be A'. Let S C ^ v ' x '\t) 
be the random subset of the infected nodes at time t that are not surrounded in the set 
of infected nodes at time t. Since 1 — l/n > 1/2, Lemma 6.2 shows that S contains at 
least half the nodes infected at time t. Thus 

P(\S\ > 5(1 + A')/2A'p) >p/2. (14) 

For each v G S there is by definition of S some neighbor, say g(v) for which the component 
of T \ v containing g(v) does not intersect ^ v,x '\t). For each such v, the probability is 
A'/(l + A') that v will pass the infection on to g(v) before recovering. Let A be the 
random set of such v. Forbid any infection of nodes in S from time t onward. Then 
the process proceeds independently on each component of T \ S containing some x G A, 
since communication has been severed. Furthermore, the starting configuration on each 
such component is precisely a single infection and the process evolves as ^ v,x '\ since 
an edge next to the initial edge has been severed. Then the £^ process dominates a 
set of independent ^ processes, started from each vertex in A. As in the theory on 
Galton- Watson processes, the survival probability is greater than the fixed point of a 
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generating function for the probabilities of the various cardinalities of A. This must be 
positive if E\A\ > 1. Since E(\A\ | \S\ > 5(1 + X')/2Xp) > 5/2p, the desired result follows 
from equation (14). 

Finally, to show continuity from the left, recall p(X') is the survival probability for 
a severed edge and parameter A', and let q be the survival probability for the unaltered 
contact process with parameter A. Let e > be arbitrary and let M be large enough so 
that (1 — p(X')) M < e/4. Also pick M large enough so that any binomial random variable 
with mean at least M is at least M/2 with probability at least 1 — e/4. Let t be large 
enough so that 

P(|£ (;A) (*)I > 4(1 + A')M/A') >q-e/A 
for the parameter value A, and pick A" with A' < A" < A large enough so that 

P(le (A,,) (OI > 2(1 + \")M/X") >q- e/2. 

Rerun the previous argument but for infection rate A". Let S be the unsurrounded 
infected nodes in £ < - ;A "- ) (£), so each v E S has a neighbor g(v) in a component of T \ S 
with no infection. There is a set A of nodes g(v) for v G S, that are infected before v 
recovers. Formally A is the set of nodes g(v) sucht that v 6 S and there exists a t' > t 
for which v is infected throughout the time interval [t,f] and such that v infects g(y) 
at time i! . The binomial hypothesis on M guarantees that A has cardinality at least M 
with probability at least q — 3e/4, since each g(v) is independently in A with probability 
A"/ (1 + A"). Each node in A defines an independent process which is a restarted version of 
£W with parameter A" > A', so the hypothesis on p(X') and M gives a survival probability 
of at least q — e for the whole process. Since e was arbitrary, the lemma is proved. □ 

It remains to show that the hypothesis of this lemma is satisfied when the phase 
transitions at Ai and A a are distinct. This is done in two steps. 

Lemma 6.4 Suppose the contact process is run on a homogenous tree for some X < X a . 
Then P(v is ever infected) — > as d(p,v) — > oo. 
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Proof: Suppose not. Then by monotonicity it decreases to some bound b > 0. Since 
A < A a the root is infected finitely often almost surely, so there is some time t and 
configuration i] such that P (survival without infecting the root any more \£(t) — rj) > 
1 — b. Since any node alive at time t infects the root with probability at least b, this is a 
contradiction. □ 

Lemma 6.5 Suppose \ < \ a . Then the survival probability for ^ on a homogenaous 
tree approaches the survival probability for the unaltered contact process as r — > oo. 

Proof: Couple all the £^ with the unaltered process, so that they evolve identically until 
an infection in the unaltered process crosses an edge that is severed in the other process. 
By Lemma 6.4 the probability of the processes ever becoming uncoupled goes to zero as 
r goes to infinity, and the result follows. □ 

Putting all these lemmas together gives 

Theorem 6.6 If A x < A a then the survival probability for the contact process from a 
single initial infection on a homogeneous tree is continuous in A. 

Proof: When the survival probability is zero there is nothing to prove. When it is 
positive and A < A a then some £ (r ) also has positive survival probability by Lemma 6.5, 
and continuity follows from Lemma 6.3. For A > A a , the existence of a A between Ai 
and A a implies the survival with positive probability of a severed-edge contact process 
for that and hence all higher A, and Lemma 6.3 again applies. □ 
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